Dimensionality Reduction for Spectral Clustering
نویسندگان
چکیده
Spectral clustering is a flexible clustering methodology that is applicable to a variety of data types and has the particular virtue that it makes few assumptions on cluster shapes. It has become popular in a variety of application areas, particularly in computational vision and bioinformatics. The approach appears, however, to be particularly sensitive to irrelevant and noisy dimensions in the data. We thus introduce an approach that automatically learns the relevant dimensions and spectral clustering simultaneously. We pursue an augmented form of spectral clustering in which an explicit projection operator is incorporated in the relaxed optimization functional. We optimize this functional over both the projection and the spectral embedding. Experiments on simulated and real data show that this approach yields significant improvements in the performance of spectral clustering.
منابع مشابه
Compressive Spectral Clustering - Error Analysis
Compressive spectral clustering combines the distance preserving measurements of compressed sensing with the power of spectral clustering. Our analysis provides rigorous bounds on how small errors in the affinity matrix can affect the spectral coordinates and clusterability. This work generalizes the current perturbation results of two-class spectral clustering to incorporate multiclass cluster...
متن کاملImagerank: spectral techniques for structural analysis of image database
Drawing on the correspondence between spectral clustering, spectral dimensionality reduction, and the connections to the Markov Chain theory, we present a novel unified framework for structural analysis of image database using spectral techniques. The framework provides a computationally eficient approach to both clustering and dimensionality reduction, or 2-D visualization. Within this framewo...
متن کاملOut-of-Sample Extensions for LLE, Isomap, MDS, Eigenmaps, and Spectral Clustering
Several unsupervised learning algorithms based on an eigendecomposition provide either an embedding or a clustering only for given training points, with no straightforward extension for out-of-sample examples short of recomputing eigenvectors. This paper provides a unified framework for extending Local Linear Embedding (LLE), Isomap, Laplacian Eigenmaps, Multi-Dimensional Scaling (for dimension...
متن کاملDistributed Spectral Dimensionality Reduction for Visualizing Textual Data
We use a Spectral Clustering model to formulate a distributed implementation using SPARK of Laplacian Eigenmaps that we call Distributed Spectral Dimensionality Reduction (DSDR). We evaluate DSDR to visualize conceptual clusters of terms in textual data from 2149 short documents written by online contributors to a State Department website. We compare DSDR with PCA, MultiDimensional Scaling, ISO...
متن کاملTechniques for Spectral Clustering
Spectral techniques, off late, have been in limelight in the machine learning community and has drawn attention of many serious machine learners. They are being used in a variety of applications like gene clustering, document analysis, image segmentation, dimensionality reduction etc. They are very simple to understand and provide highly accurate results even for difficult clustering problems. ...
متن کامل